IDIAP Higher - Order Statistics in Visual Object

نویسنده

  • Thomas M. Breuel
چکیده

In this paper, we develop a higher-order statistical theory of matching models against images. The basic idea is not only to take into account how much of an object can be seen in the image, but also what parts of it are jointly present. We show that this additional information can improve the speciicity (i.e., reduce the probability of false positive matches) of a recognition algorithm. We demonstrate formally that most commonly used quality of match measures employed by recognition algorithms are based on an independence assumption. Using the Minimum Description Length (MDL) principle and a simple scene-description language as a guide, we show that this independence assumption is not satissed for common scenes, and propose several important higher-order statistical properties of matches that approximate some aspects of these statistical dependencies. We have implemented a recognition system that takes advantage of this additional statistical information and demonstrate its eecacy in comparisons with a standard recognition system based on bounded error matching. We also observe that the existing use of grouping and segmentation methods has signiicant eeects on the performance of recognition systems that are similar to those resulting from the use of higher-order statistical information. Our analysis provides a statistical framework in which to understand the effects of grouping and segmentation on recognition and suggests ways to take better advantage of such information. 1 (a) (b) Figure 1: Standard recognition algorithms work well for objects with well-deened geometries (a), but fail to recognize even simple natural objects (Snoopy's head, b).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Idiap Higher-order Statistics in Visual Object Recognition

In this paper we develop a higher order statistical theory of matching models against images The basic idea is not only to take into account how much of an object can be seen in the image but also what parts of it are jointly present We show that this additional information can improve the speci city i e reduce the probability of false positive matches of a recognition algorithm We demonstrate ...

متن کامل

Visual Tracking using Learning Histogram of Oriented Gradients by SVM on Mobile Robot

The intelligence of a mobile robot is highly dependent on its vision. The main objective of an intelligent mobile robot is in its ability to the online image processing, object detection, and especially visual tracking which is a complex task in stochastic environments. Tracking algorithms suffer from sequence challenges such as illumination variation, occlusion, and background clutter, so an a...

متن کامل

Speech & face based biometric authentication at IDIAP

We present an overview of recent research at IDIAP on speech & face based biometric authentication. This paper covers usercustomised passwords, adaptation techniques, confidence measures (for use in fusion of audio & visual scores), face verification in difficult image conditions, as well as other related research issues. We also overview the open source Torch library, which has aided in the im...

متن کامل

Idiap at MediaEval 2013: Search and Hyperlinking Task

The Idiap system for search and hyperlinking uses topicbased segmentation, content-based recommendation algorithms, and multimodal re-ranking. For both sub-tasks, our system performs better with automatic speech recognition output than with manual subtitles. For linking, the results benefit from the fusion of text and visual concepts detected in the anchors.

متن کامل

Idiap Recognition of Handprinted Digits 1 Using Optimal Bounded Error Matching

This paper describes a system that recognizes hand-printed digits. The system is based on optimal bounded error matching, a technique already in common use in general-purpose 2D and 3D visual object recognition systems in cluttered, noisy scenes. In this paper, we demonstrate that the same techniques achieve high recognition rates (up to 99.2%) on real-world data (the NIST database of hand-prin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993